Accelerating Disease Gene Identification Through Integrated SNP Data Analysis
نویسندگان
چکیده
Information about small genetic variations in organisms, known as single nucleotide polymorphism (SNPs), is crucial to identify candidate genes that have a role in disease susceptibility, a long-standing research goal in biology. While a number of established public SNP databases are available, the specification of effective techniques for SNP analysis remains an open issue. We describe a secondary SNP database that integrates data from multiple public sources, designed to support various experimental ranking models for SNPs. By prioritizing SNPs within large regions of the genome, scientists are able to rapidly narrow their search for candidate genes. In the paper we describe the ranking models, the data integration architecture, and preliminary experimental results.
منابع مشابه
Rapid Detection of Rare Deleterious Variants by Next Generation Sequencing with Optional Microarray SNP Genotype Data
Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease-causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome-wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its a...
متن کاملSNP Marker Assisted Selection for Identification of Fusarium Resistant Melon Plants
Melon is an important crop cultivated in moderate climate regions of the world. One of the most important diseases of this plant is vascular wilt caused by Fusarium oxysporum f.sp. melonis (Fom). Infection of farm by this pathogen can result in huge damage around the world. Development of resistant varieties is the most effective method for disease control. Four races of 0, 1, 2 and 1,2 have be...
متن کاملSingle Nucleotide Polymorphism (SNP) in the Adiponectin Gene and Cardiovascular Disease
Dear Editor, The recent article by Mohammadzadeh et al.[1] on the latest issue of this Journal showed that the T allele +276G/T SNP of ADIPOQ gene is more associated with the increasing risk of coronary artery disease (CAD) in subjects with type 2 diabetes. Adipocytes were described in myocardial tissue of CAD patients and their role recently discussed[2,3]. Susceptibility to CAD by polymorp...
متن کاملPATIKA: an integrated visual environment for collaborative construction and analysis of cellular pathways
MOTIVATION Availability of the sequences of entire genomes shifts the scientific curiosity towards the identification of function of the genomes in large scale as in genome studies. In the near future, data produced about cellular processes at molecular level will accumulate with an accelerating rate as a result of proteomics studies. In this regard, it is essential to develop tools for storing...
متن کاملIdentification of gene-gene interaction using principal components
After more than 200 genome-wide association studies, there have been some successful identifications of a single novel locus. Thus, the identification of single-nucleotide polymorphisms (SNP) with interaction effects is of interest. Using the Genetic Analysis Workshop 16 data from the North American Rheumatoid Arthritis Consortium, we propose an approach to screen for SNP-SNP interaction using ...
متن کامل